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ABSTRACT 

A DIFFERENTIAL  FOR  L-STATISTICS 

Let  X,  5 X-  s ...  s X be  an  ordered  sample  from  a distribution  F and 
In  2n  "" 


nn 


rn 

c.  a sequence  of  constants.  Statistics  of  the  form  ).  .c.  X.  are  called 
in  ^ ^1=1  in  in 

"linear  functions  of  order  statistics,"  "L-estimators,"  or  simply  "L-statistics." 

Various  methods  of  generating  the  constants  c.  have  been  considered,  including 
-1  -1  - 

c.  =n  ^J(i/h+l)ot  n (1/n)  and c- =/  J(u)du  for  fixed  "score"  functions 

(i-l)/n 
rn  ^ 

J.  L-statistics  of  the  form  T = >.  ,c.  X.  can  be  obtained  from  the 

n ^1=1  in  in 

functional  T(F)  = Jf"^ (t) J (t)dt  by  substitution  of  the  sample  d.f.  F^  for  F. 

Under  the  mild  assumption  that  J is  bounded  and  continuous  a.e.  Lebesgue  and 
a.e.  F ^ and  under  a tail  restriction  on  F of  the  form  /q(F(x))dx  < «> 

(e.g.,  q(t)  = [t(l-t)]  , 0 < 6 < %) , it  is  shown  that  T(-)  has  a Frechet- 

type  differential.  The  tail  restriction  may  be  dropped  if  J trims  the  extremes. 
In  either  case  it  follows  that  if  {X^}  is  a sequence  of  independent  observations 
on  F,  then  *^(T(F^) -T(F) ) is  asymptotically  normal  and  obeys  a law  of  the 
iterated  logarithm.  Continuity  of  T holds  under  milder  conditions  on  J and 
F.  This  leads  to  strong  consistency,  T(F^)  ->  T(F).  No  continuity  restric- 

tions are  imposed  on  F,  so  that  tlie  results  are  aonlicable  to  a wide  class  of 
distributions  of  interest  in  robust  estimation.  Illustration  is  provided  by 
examples  including  the  trimmed  mean,  the  smoothly  trimmed  mean,  and  approxi- 
mations to  the  interquartile  range.  The  asymptotic  normality  result  is  competi- 
tive with  one  of  Stigler  (1974)  for  the  closely  related  statistic 

S = ,c.  X.  , obtained  under  stronger  conditions  on  J but  a slightly 

n ‘•1=1  in  in’ 

milder  condition  on  F.  However,  in  addition  to  asymptotic  normality  of 


the  differential  approach  of  the  present  paper  yields  characterization  of 
the  almost  sure  behavior  of  T(F^)  and  lends  itself  to  straightforward  extension 
to  the  case  of  dependent  variables. 


1 . INTRODUCTION 


Let  X.  s X-  s ...  s X be  an  ordered  sample  from  a distribution  F and 
in  Zn  nn  ^ 

c.^  a sequence  of  constants.  Statistics  of  the  form  7”  ,c.  X.  are  called 
in  ^ ^iai  in  in 

"linear  functions  of  order  statistics"  (e.g.,  Shorack  (1972),  Stigler  (1974)), 

"L-estimators"  (Huber  (1972)),  or  simply  "L-statlstics" . Various  methods  of 

generating  the  constants  c^^  have  been  considered,  including  Cj^^  » n"^J(i/n) 

or  n ^J(i/n*l)  and  c^^^  = fixed  "score"  functions  J.  By 

choosing  the  c^^^  (or  J)  properly,  the  L>estimator  can  be  made  insensitive  to 

outliers  or  "robust"  with  regard  to  long-tailed  distributions.  Thus  L-estimators 

have  played  an  important  role  in  the  development  of  the  modern  theory  of  robust 

estimation.  Since  their  exact  sanqiling  distributions  are  difficult  to  compute 

even  under  strict  model  assumptions,  much  work  has  focused  upon  the  question 

of  asymptotic  normality.  There  appear  to  be  four  main  approaches;  (1)  Weak 

convergence  in  connection  with  the  empirical  process,  Bickel  (1967),  Shorack 

(1969,  1972);  (2)  Hajek's  projection  method,  Stigler  (1969,  1974);  (3) 

approximation  of  the  L-estimator  by  sums  of  independent  exponential  r.v.'s, 

Chemoff,  Gastwirth,  and  Johns  (1967);  (4)  Frechet  differentials,  Gregory  (1976). 

The  present  paper  treats  L-statistics  of  the  form  T = 7?  ,c,  X.  and 
^ ^ n ‘•i»l  in  in 

uses  a differential  method  similar  to  Gregory's,  but  the  restrictions  on  the 

A 

Cj^^  and  the  underlying  d.f.  F are  similar  in  spirit  to  Stigler  (1974)  who 
treats  the  closely  related  statistic  s ■ 7"  iC.  X.  . Further,  in  addition 
to  asymptotic  normality,  the  main  theorems  of  Sections  4 and  5 yield  strong 
consistency  and  a law  of  the  iterated  logarithm  (LIL) . More  general  versions 
of  these  last  two  results  have  been  given  recently  by  Wellner  (1977a),  (1977b). 

The  L-statistics  we  consider  may  be  represented  in  terms  of  a functional 
defined  on  the  space  of  d.f.'s.  The  basic  functional  of  interest  is  given  by 
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(1.1) 


T(F)  * / F'^(t)dK(t)  . 
0 


where  F'^(t)  » inf{x:  F(x)  i t)  and  K(t)  is  a right  continuous  function  of 

bounded  variation  on  [0,1],  In  this  paper  we  restrict  K(t)  to  be  absolutely 

t 

continuous  by  putting  K(t)  = / J(u)du,  for  J integrable  on  [0,1 J.  Then  (1.1) 

0 

becomes 


(1.2) 


T(F)  = / F'^t)J(t)dt  . 
0 


(Note  that  finite  linear  combinations  of  quantiles  are  thus  excluded.  However 
in  Example  (iii)  of  Section  4 we  show  how  to  approximate  such  combinations  by 
appropriate  choices  of  J.)  For  the  case  that  K(t)  is  a d.f.  symmetric  about 
J,  Bickel  and  Lehmann  {1976^  I and  II)  discuss  the  value  of  (1.1)  as  a measure 
of  location.  We  are  allowing  K(t)  to  represent  a signed  measure  in  order  to 
include  a large  class  of  scale  functionals  as  well. 

The  sample  d.f.  F^  generated  by  a sample  Xj,  ...,  is  defined  by 

■ s j,«»i  ^ «)  • 

For  estimation  purposes  we  set  T^  ■ T(F^)  by  substitution  of  F^  for  F in  (1.1) 
This  statistic  can  be  put  in  a more  familiar  form  by  noting  that 

/^F‘^t)JK(t)  - /*xdK(F(x)) 

0 .on 


for  any  d.f.  in  the  domain  of  T(‘)  (the  change  of  variable  is  justified  by 
Lemma  12,  Section  3).  Then  substitution  of  F^  for  F in  this  last  expression 
yields 


(1.3)  T(F^)  - / xdK(F^(x))  - / X d 


/ J (u)du 


■ A 


[5  J i-l  i-1 


J(u)du, 


l....  T(f„)  . T„. 
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In  Sections  2 and  3 some  preliminaries  are  developed.  Section  2 provides 
the  definitions  of  the  differential  and  of  continuity,  the  basic  tools  to  be 
utilized  in  the  sequel,  and  important  convergence  results  for  *’„”*'*  ^ 

interest  here  are  recent  results  by  O'Reilly  (1974),  James  (1975),  and  Wellner 
(1977a)  for  weighted  empirical  processes.  Statistical  application  is  provided 
by  Theorem  1 (strong  consistency).  Theorem  2 (asymptotic  normality),  and 
Theorem  3 (LIL).  Section  3 provides  a useful  representation  for  the  difference 
T(F)  - T(G)  and  a related  inequality. 

In  Sections  4 and  5 we  will  prove  that  under  suitable  restrictions  on 


J,  F,  and  the 

sequence  (X^),  we  have 

(1.4) 

lim  T(F  ) = T(F)  w.p.l  ; 

(1.5) 

/iT(T(F^)  - T(F))  -1^>  N(0,  oh 

(1.6) 

rr-  /n(T(F  ) - T(F)) 

lim  ' n"^  a 1 w.o 

n-»«  — ^ 

/2o  log  log  n 

Continuity  of  T leads  to  (1.4),  and  the  existence  of  a differential  for 
T leads  to  (1.5)  and  (1.6).  For  proving  continuity  we  assume  that  J is 
bounded  and  integrable  on  [0,1].  For  proving  the  existence  of  a differential 
we  assume  that  J is  bounded  and  continuous  a.e.  Lebesque  and  a.e.  F~^ . In 
Section  4 we  further  restrict  J to  be  0 in  some  neighborhoods  of  0 and  1, 
making  T robust  in  the  Hampel  (1971)  sense.  Theorem  4 provides  the  existence 
of  a differential  in  this  case  w.r.t.  the  sup-norm  11*11^.  A corollary 
yields  (1.5)  and  (1.6)  for  I.I.D.  r.v.'s  by  appeal  to  Theorems  2 and  3. 
Theorem  5 yields  continuity  of  T .wrr.t.  ||*|l*  (1.4).  In  Section  5 

we  trade  the  trimming  condition  on  J for  a tail  restriction  on  F of  the  form 
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1 ^ e 1 

/q(F(x))dx  < “ (e.g.,  q(t)  = (t(l-t))*"  , 0 < 6 < y) . Theorem  6 then  provides 

the  existence  of  a differential  w.r.t.  the  q-norm  ll'llqfpj*  essentially 

defined  by  M’llqjpj  ® I 1 I loo*  Corollaries  1 and  2 then  yield  (1.5) 

and  (1.6)  in  the  I.I.D.  case  for  different  classes  of  q functions  (tail 

restrictions).  Continuity  and  (1.4)  are  provided  by  Theorem  7. 

Section  6 generalizes  the  results  of  Sections  4 and  5 to  the  functional 
1 ,1 

T(F)  = / h(F  (t))dK(t).  In  Section  7 specific  comparisons  are  made  with 
0 

related  work  of  Stigler  (1974),  Gregory  (1976),  and  H'ellner  (1977a),  (1977b). 

2.  THE  DIFFERENTIAL  AND  ITS  STATISTICAL  APPLICATIONS 

vJost  of  this  section  can  be  found  in  expanded  form  in  Boos  and  Serfling 
(1977).  We  begin  with  the  definition  of  the  differential  and  motivation  for 
its  use.  Lemmas  1-4  spell  out  the  statistical  properties  of  the  differential 
and  Lemma  5 relates  continuity  of  functionals  to  consistency  of  T(F^).  A 
discussion  of  specific  norms  follows,  including  Lemmas  6-11  which  provide 
some  convergence  properties  of  | |f^  - F||  for  these  norms.  Lastly,  Theorems 
1,  2,  and  3 provide  (1.4),  (1.5),  and  (1.6)  for  the  usual  situation  of  I.T#D, 
r.v. *s. 

Let  T be  a real-valued  functional  defined  on  a convex  set  F of  d.f.'s. 
Denote  by  9(F)  the  linear  space  generated  by  differencee  H-G  of  members  of 
F,  i.e.. 


9(F)  « (A;  A « a (H-G),  H,G  c F,  a e R). 

Assume  9(F)  is  equipped  with  a norm  ||*1|,  to  be  specified  later. 

DEFINITION.  We  say  that  a functional  T defined  on  F has  a differential 
at  the  point  F g F w.r.t.  the  norm  |l*ll  and  the  set  Gp  e F if  there  exists 


a quantity  T(F;  A)  defined  on  A e i?(F),  which  is  linear  in  the  argument  A 
and  satisfies  the  condition 


li„  L(G)_-  T(F)^T(F;  G-F)  ^ ^ ^ 

IlG-F|1^0  ||G-F|| 

G e Gp 


T(F;  A)  is  called  the  "differential."  By  linearity  of  T(F;  A)  is  meant  that 

k k 

T(F;  I a A.)  = I a T(F;  A ) 
i=l  ^ i=l  ^ ^ 

, ...»  aj^.  Often  Gp  = F,  although  sometimes 

(2.1)  is  easier  to  show  for  special  choices  of  Gp.  When  mention  of  Gp  is 
omitted,  it  will  be  assumed  that  G_  = F. 

r 

The  intuitive  content  of  the  above  definition  is  that  T(G)  - T(F)  can 
be  closely  approximated  by  the  differential  T(F;  G-F) , whose  linearity  property 
can  then  be  exploited.  An  alternative  statement  of  (2.1)  is 

(2.2)  T(G)  - T(F)  = T(F;  G-F)  + o(||G-F||)  as  |1G-F||  - 0.  G e Gp. 

For  statistical  applications  we  substitute  the  sample  d.f.  F^  for  G and  find 
that  T(F^)  - T(F)  is  approximated  closely  by  T(F;  F^  - F)  in  a stochastic 
sense  (to  be  made  clear  in  the  lemmas  below).  In  order  to  examine  T(F;  F^-F), 
recall  that  for  a sample  X^,  ....  X^,  the  sample  d.f.  may  be  written  as 

F^  ■ n"^  , where  6^  denotes  the  d.f.  of  a r.v.  degenerate  at  x.  Then, 

by  the  linearity  of  the  differential, 

T(F;  F^-F)  * T(F;  ^ \ (6^^  -F))  = ^ ^ T(F;  -F)  . 

i=l  i i=l  i 

VJe  assume  throughout  this  paper  that  T(F;  ^j^-F)  is  well-defined  and  measurable 


for  Aj,  ...,  Aj^  e P(f)  and  real  a^ 
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w.r.t.  the  probability  space  induced  by  X.  For  convenience  we  set  T(F; 

T[F;  x]  and  note  that  T[F;  x]  = IC„  -(x),  the  influence  curve  of  T at  F (see 

• A , r 

Hampel  (1974)).  (In  Boos  and  Serfling  (1977),  T[F;  x]  = T(F;  <5^-F)  - 
/T(F;  6^-F)dF(x);  this  latter  expectation  is  0 for  L-statistics.)  Thus  we 
see  that  T(F;  Fj^-F)  reduces  to  an  average  of  identically  distributed  r.v.'s, 
T[F;  , which  are  independent  if  the  original  sample  X^,  X^  consists  of 

independent  r.v.'s. 

Before  pursuing  the  statistical  applications  of  the  differential,  we  form- 
ulate the  related  concept  of  continuity  of  functionals. 

DEFINITION.  The  functional  T is  said  to  be  oontinuous  at  F w.r.t. 

I I • II  and  Gp  if 

(2.3)  lim  T(G)  = T(F)  . 

|lG-F||  ^ 0 
G e Gp 

In  order  to  facilitate  the  statement  of  a number  of  short  lemmas,  we  list 
here  a group  of  conditions.  The  sample  d.f.  F^  will  always  be  assumed  to  be 
generated  by  a sample  (not  necessarily  independent)  X^^,  ...,  X^  from  a dis- 
tribution F. 

CONDITIONS. 


(2.4)  T has  a differential  at  F w.r.t.  11*11  and  Gp; 

(2.5)  ^ sufficiently  large}  = 1; 

(2.6)  lim  1 |F^-F|  I = 0 w.p.l  ; 

n-H» 

(2.7)  ✓nl 1f^-f1 I = Op(l)  as  n - - ; 
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(2.9)  E_{T[F;X]}  = 0,  Var.{T[F;X]}  = > 0; 

r r 


(2.10)  Xj,  ....  X^  are  independent  and  identically  distributed  with  d.f.  F. 

RE>iARKS.  (i)  For  some  applications,  conditions  (2.5)  and  (2.6)  may  be 
replaced  by  weaker  versions  using  convergence  in  probability,  (ii)  Condition 
(2.7)  is  equivalent  to  the  condition  that  the  sequence  of  distributions 
corresponding  to  /n||F^-F||  is  tight,  (iii)  Conditions  (2.9)  and  (2.10) 
imply  the  classical  central  limit  theorem 

(2.11)  — [ T[F;X  ] N(O.a^)  as  n ~ , 

/n  i=l  ^ 

and  the  classical  law  of  the  iterated  logarithm 

I T[F;X.] 

(2.12)  lim  = 1 w.p.l.  □ 

/2o^n  log  log  n 

The  following  lemmas  provide  the  foundation  for  using  the  differential  and 

the  related  concept  of  continuity  in  statistics.  The  proofs  are  trivial  and 

will  be  ommitted  for  all  but  Lemma  4. 

LBilA  1.  If  (2.4),  (2.5),  and  (2.6)  hold,  then 

T(F  ) - T(F)  - T(F;F  -F) 

lim  =0  inp.l.  , 

n-»  l|F^-F|| 

or  equivalently 

T(F^)  - T(F)  = i I^T[F;X.]  ♦ o ( | | F^-F | |)  as  n ^ « w.p.l. 


LEflA  2.  If  (2.4),  (2.5),  and  (2.7)  hold,  then 


/n(T(F^)  - T(F)  - T(F;F^-F))  -^>  0 as  n , 


LECtA  3.  If  (2.4),  (2.5),  (2.7),  (2.9),  and  (2.10)  hold,  then 


*^(T(F^)  - T(F))  N(0,a^)  as  n -►  » . 


(Note  that  conditions  (2.^  and  (2.10)  are  used  here  only  to  get  (2.11).) 

LEP1A  4.  If  (2.4),  (2.5),  (2.8),  (2.9),  and  (2.10)  hold,  then 

»^(T(F^)  - T(F)) 

lim  — = 1 w.-p.l  . , 

A 2 , , 

*'2a  log  log  n 


(Note  that  condition  (2.10)  is  needed  here  only  to  get  (2.12).) 
P.IOOF.  Write  T(F^)  - T(F)  as 


T(F^)  - T(F)  = [T(F^)  - T(F)  - T(F;F^-F)]  + T(F;F^-F), 


By  (2.12), 


/nT(F;F  -F) 
lim  - ■ 

/2a^  log  log  n 


= 1 w.p.l, 


Thus  it  suffices  to  show  that 


, . /n[T(F  ) - T(F)  - T(F;F  -F)] 
lim  n n 


0 w.p.l, 


'2c  log  log  n 


Write  this  last  expression  as 


T(F^)  - T(F)  - T(F;F^-F)  _ /S’!  If^-F] 


l|F„-F|| 


>'2o  log  log  n 
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The  first  term  converges  to  0 w.p.l  by  Lemma  1,  and  the  second  term  is 
bounded  w.p.l  by  (2.8).  0 

LEi®.«A  5.  If  (2.3),  (2.5),  and  (2.6)  holdj  then 

lim  T(F  ) =»  T(F)  w.p.l  . 

n-K» 

The  general  statistical  application  of  the  differential  and  Of  continuity 
is  well  characterized  by  the  preceding  five  lemmas.  It  should  be  clear  how  to 
extend  Lemmas  3 and  4 to  the  case  of  dependent  variables.  After  presenting  a 
class  of  norms  which  satisfy  (2.6)  - (2.8),  we  will  apply  the  above  general 
results  to  get  Theorems  1,  2,  and  3 which  provide  strong  consistency,  asymptotic 
normality,  and  an  LIL  for  T(F^)  in  the  case  of  I.I.D.  r.v.’s. 

The  statistical  value  of  the  theory  developed  in  this  section  depends 
heavily  on  the  choice  of  norm  | | • 1 1 • f&ct , the  norm  must  serve  two  some- 
what conflicting  purposes.  For  satisfaction  of  (2.1)  of  the  definition  of 
the  differential,  we  would  like  a relatively  "large"  norm,  whereas  conditions 
(2.6)-(2.8)  are  most  easily  satisfied  for  "small"  norms.  We  now  introduce 
a class  of  norms  for  which  a number  of  useful  stochastic  results  are  available. 
Let  F be  a fixed  d.f.  and  let  the  closure  of  (x^,X2)  be  the  smallest 
interval  (possibly  infinite)  containing  the  support  Sp  of  F.  Let  F = 

(d.f.  G:  S_cS_},  and  let  V(F)  be  the  companion  linear  space  of  differences. 

G r 

For  a bounded  positive  function  q on  (0,1),  we  define 


(2.13) 


' sup 
Xj<X<X, 


A(x) 

^TFTxrr  ’ 


A e P(F) . 


For  q(t)=l  we  get  the  usual  sup-norm,  ||a||  » sup  ^„^„|A(x)|,  since  on  T(F) 

we  have  sup^^^^^^^l A(x) 1 = ^ * 
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The  important  choices  of  q function  are  those  for  which  q(t)-*-0ast-+-0 
and  1.  They  produce  nonequivalent  norms  which  are  "larger"  than  1|*||^,  for 

(2.14)  ||a||„  S ||qlLI|A||^jP^,  A e P(F). 

The  potential  use  of  such  norms  in  verifying  (2.1)  can  be  seen  from  the 
following  inequality.  Let  Sp  = (-<*», “)  and  /*  q(F(x)dx  < •.  Then 


00 

1 A(x)dx 
•00 

f A_Cx)  1 

q(F(x))dx 

« 

• 0 

|q(F(x))J 

0 

^ I 1^1 lq(F)  / q(P(x))dx  . 

A similar  inequality  will  be  employed  in  Section  5 to  show  that  L-functionals 
have  a differential  w.r.t.  ll’ll^^Pj. 

Even  though  1 1 • 1 1,0  I’®  viewed  as  a member  of  the  larger  class  of  q-norms, 

it  is  advantageous  to  consider  11*11*,  itself.  Historically,  results  related 
j to  (2.6)-(2.8)  for  11*11^  generally  preceded  the  results  for  ll'llq(F)* 

I sequences  of  independent  and  identically  distributed  r.v.'s,  condition  (2.6) 

I with  11*11  ■ 1|.|1^  is  just  the  Glivenko-Cantelli  Theorem.  The  conclusion 

I of  the  following  lemma  implies  (2.7)  for  11*11  ■ 11*1!  • 

I LEi't'A  6.  Let  {X^}  he  a sequence  of  independent  observations  on  a non- 

degenerate  distribution  F.  Then 

(2.15)  1^1  Zp  ae  n “ , 

where  Zp  is  positive  w.p.l  . 

The  proof  of  (2.15)  for  the  case  of  F continuous  was  first  given  by 
Kolmogorov  (1933).  The  distribution  of  Zp  was  given  explicitly  and  seen  not 
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to  depend  on  F.  Extension  to  the  case  of  F having  finitely  many  discontinuities  and 
not  being  purely  atomic  was  obtained  by  Schmid  (1958).  Here  the  distribution  of 
2p  was  given  explicitly;  it  depends  upon  F in  the  case  of  discontinuities.  The 
general  case  is  treated  in  Billingsley  (1968),  Section  16.  See  Boos  and  Serfling 
(1977)  for  fiirther  discussion. 

The  conclusion  of  the  next  lemma  yields  (2.8)  for  ||*11  “ 

LE/CIA  7.  Let  {X^}  be  a sequence  of  independent  observations  on  a distri- 
bution F . Then 

(2.16)  lim  ■ ^ = sup  /2F(x)(l-F(x))  w.p.l. 

n-+«  /log  log  n -oo<x<oo 

The  proof  of  (2.16)  in  the  case  of  F continuous  was  given  by  Chung  (1949). 

Extension  to  the  case  of  F having  discontinuities  is  due  to  Richter  (1974). 

Now  we  turn  to  the  norm  ll’llq^p^-  *ost  of  the  results  in  the  literature 
concerning  llF^‘F|iq^pj  ate  for  F continuous.  The  following  lemma  shows  how  to 
use  such  results  for  the  purpose  of  verifying  (2.6)-(2.8)  for  arbitrary  F.  Let 
H denote  the  d.f.  of  uniform  (0,1)  r.v.'s. 

8.  Let  {X^}  be  a sequence  of  independent  r.v.  's  on  a given  probability 
space t having  distribution  F.  Then  a sequence  {U^^}  of  independent  uniform 
(0,1)  r.v.  's,  with  sample  d.f.  can  be  constructed  such  that 

(2.17)  IIV’'llq(F)  ^ W.p.l  , all  n . 

PROOF.  For  the  given  sequence  {X^>,  defined  on  an  arbitrary  probability 
space,  it  is  possible  by  means  of  randomization  to  construct  uniform  (0,1) 
r.v.'s  {Uj^}  such  that 

Xi  - F’^(U^)  w.p.l  . 
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This  is  well-knovm,  but  a construction  is  provided  in  Boos  (1977).  Let  H be 

n 

the  sample  d.f.  of  the  constructed  sequence  {U^}  and  the  sample  d.f.  of  the 
sequence  {X^}.  Then 


F^(x)  = H^(F(x))  w.p.l  , 


and 


sup 

Xj^<x<X2 


F„(x)  - F(x) 

H^(F(x))  - F(x) 

q(F(x)) 

* sup 

x^<x<x^ 

q(F(x)) 

s 


sup 

0<t<l 


H^Ct)  - H(t) 
q(t) 


w.p.l  , 


with  equality  if  F is  continuous.  0 

The  bound  (2.17)  aids  us  in  the  following  way.  Suppose  that  condition 
(2.6),  (2.7),  or  (2.8)  holds  for  sequences  of  independent  uniform  (0,1)  r.v.'s. 
Then  for  arbitrary  sequences  {X^},  the  conclusion  of  the  lemma  allows  us  to 
bound  the  quantity  of  interest,  say  I I lq(pj  *^l  I I lq(pj » etc.,  by 
a quantity,  I Iq^^)  ^q(H)  ’ satisfies  the  condition  in 

question. 

We  are  now  ready  to  use  recent  results  relating  to  (2. 6) -(2. 8).  Lemmas 

9,  10,  and  11  are  taken  from  Wellner  (1977a),  O'Reilly  (1974),  and  James  (1975) 

respectively.  We  attempt  to  preserve  each  author's  notation.  Each  lemma  is 

followed  by  a corollary  which  provides  the  proper  extension  for  our  application. 

Let  ff(f)  denote  the  set  of  all  nonnegative,  nondecreasing,  continuous 

,1 

functions  on  [0,1]  for  which  / (l/h(t))dt  < ».  Let  H denote  the  set  of  all  h 

0 

such  that  h(t)  * h(l-t)  = h(t)  for  Osttsi  and  some  ?T  in  H(+). 

LEfflA  9.  (IVellner).  Let  (U^)  be  a sequence  of  independent  uniform  (0,1) 
r.v.  'a.  Let  h e H.  Then 
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(2.18) 


lim  sup 
n-K»  Ostsl 


Fn(t)-t 

h(t) 


0 w.p.2  . 


It  is  apparent  that  the  conclusion  of  Lenuna  9 can  be  extended  to  = {q:  q is 

bounded  on  [0,1]  and  q(t)  2 h(t)  Vt  e [0,1],  some  h e H}.  The  following 
corollary  follows  immediately  from  Lemmas  8 and  9. 

COROLLARY.  Let  {X^}  be  a eequence  of  independent  r.v.  'e  having  distribution 
F {not  neceaacopily  cx>ntinuoua) . Let  q z Then 

(2.19)  lim  l|F_-F||  = 0 w.p.l  . 


Characterization  of  condition  (2.7)  for  ||*1|  = ll’llq^p^  is  essentially 
provided  by  the  following  weak  convergence  result,  found  as  Theorem  2 in  O'Reilly 
(1974).  Let  D[0,1]  be  the  space  of  real-valued  functions  on  [0,1]  having  only 
jump  discontinuities.  Let  p^(x,y)  = ®^Po<tsl  I 1 • Since  (^.b^) 

is  not  separable,  O'Reilly  uses  "weak  convergence"  in  the  sense  of  Definition 
2.1  of  Pyke  and  Shorack  (1968).  Let  ^^(t)  be  a "tied-down"  Wiener  process  (or 
"Brownian  bridge") . 


LE’IMA  10.  (O'Reilly).  Let  q be  a continuous,  nonnegative  function  on 
[0,1],  hounded  away  from  zero  on  [a,l-a]  for  some  0<a<j,  nondecreaaing  {non- 
increaaing)  on  [0,o]  ([l-a,l]).  Let  {U^}  be  a sequence  of  independent  uniform 
[0,1]  r.v.  'a,  and  define  U^^(t)  = v^(F^(t)-t).  Then 


(2.20) 


for  all  e > 0,  i=l , 2 


is  both  a necessary  and  sufficient  condition  for  the  weak  convergence  of 
to  in  (D,p^)  where  hj(t)  = t"^q(t)  and  h2(t)  = t"^q(l-t). 
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Let  be  the  set  of  q functions  satisfying  the  conditions  of  Lemma  10 
including  (2.20).  Let  = (q:  q is  bounded  on  [0,1]  and  q(t)  2 

q*(t)  Vte[0,l],  some  q*(t)  e Q^}.  The  following  corollary  follows  from  the 
proof  of  O'Reilly’s  Theorem  2. 

COROLLARY.  Let  {X^}  he  a sequence  of  independent  r.u.  'a  having  distribution 
F (not  neoeaearily  continuous').  Let  qeQj.  Then 

(2.21)  |F^-F!  = 0p(l)  aa  n - » . 


PROOF.  Following  the  proof  of  O'Reilly's  Theorem  2,  let  0^  and  be 
versions  of  and  defined  on  a common  probability  space  such  that 
supo^t^i  ^ ^ O'Reilly  shows  that 


sup 

Ostsi 


»ntt)  - W^(t) 
qlF) 


0 


as  n . 


Thus 


sup 

OStSl 


Un(t) 

L 

Cfm 

WJt) 

q(t) 

9UU 

Ostsi 

q(t) 

as 


n -►  ® . 


Moreover,  sup^^^^j |w^(t)/q(t) | has  a (finite)  distribution  and  by  the  bound 
given  in  Lemma  8,  (2.21)  follows.  □ 

The  next  result,  due  to  James  (1975),  characterizes  (2.8)  for  1)*11  * ll'll^^pj- 
In  accord  with  James'  notation,  we  let  W be  the  set  of  positive  real-valued 
functions  w on  [0,1]  such  that  for  some  0<6Sj,  t^w(t)  is  monotone  increasing  on 
(0,6],  (l-t)^w(t)  is  monotone  decreasing  on  [1-6,1),  and  w is  bounded,  on- [6-;I^6] . 

Here  w plays  the  role  of  1/q  and  values  at  0 and  1 are  arbitrary. 

LE^f>lA  11.  (James).  Let  IJ^(t)  he  defined  as  in  Lenvna  10.  Let  w c W.  If 
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-Jf- 


(2.22) 


w^t) 


log  log  ( 


dt  < *, 


t(l-t)- 


then 


(2.23) 


lim  sup 
n-»«  OStSl 


Uj^(t)w(t) 


r/2  log  log  n 


* sup  [t (l-t)l^w(t)  w.p.l 
Ost<l 


Conversely^  if  (2.22)  diverges,  then  the  l.h.s.  of  (2.23)  is  +»  w.p.l. 

Let  = (q:  o.=l/w,  vc”  and  w satisfies  (2.2?)}  and  0^  = {qtqis  bounded  on  [0,1] 

■—V  q(t)  s q (t)  Vte(0,l),  sorre  q*(t)  e The  ■‘^ollowine  corollary  is  immediate 

from  Lemmas  ?>  and  11. 

C0".0LLARY.  Let  (Xp  he  a sequence  of  zndependent  r.vi's  having  disiribution  F 
(hot  neoessariiy  continuous).  Let  qeQ^.  Then  there  exists  a constant  M ® such  that 


(2.24) 


lim  — 5 M w.p.l  . 

n-H®  /log  log  n 


EXAMPLE.  For  any  0 < 6j  < j, 

q(t)  = (t(l-t))^"^l 


belongs  to  both  Q2  and  (and  q (t)  belongs  to  Q^).  Gaenssler  and  Stute  (1976) 
note  that  for  q(t)  = (t(l-t))^  and  Uj^(t)  defined  as  in  Lemma  10,  we  have 


(2. 25) 


sup 

0<t<l 


Un(t) 


(t(l-t)) 


as  n -►  « 


and 


(2.26) 


sup 

0<t<l 


Un(t) 


/2t(l-t)  log  log  n 


^ 1 


as  n -*•  *. 
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However,  the  last  statement  of  Lemma  11  tells  us  that  the  l.h.s.  of  (2.26)  is 
w.p.l.  Thus  a "weak"  form  of  (2.8)  holds  even  though  (2.7)  and  (2.8)  do 
not  hold.  □ 

We  conclude  this  section  with  three  theorems  which  combine  the  norm  theory 
just  discussed  with  Lemmas  3-5. 

THEOREH  1.  Suppose  that  T is  oontimujus  at  F w.r.t.  ||*il„  (.or  w.r.t. 
^^*^U(F)’  ^ ^ ^1^  *^F'  (2.5)  and  (2.10)  hold^  then 


(2.27) 


lim  T(F^)  = T(F)  w.p.l  . 

n-H» 


THEOREM  2.  Suppose  that  T has  a differential  at  F w.r.t.  ||*Mo> 

(or  w.r.t.  Il*llq(py  e Qj)  and  Gp.  If  (2.5),  (2.9),  and  (2.10)  hold,  then 


(2.28) 


^■(T(F^)  - T(F))  N(0,o  ) aa  n - » . 


THEOREM  3.  Suppose  that  T has  a differential  at  F w.r.t.  | |*| |^  (or 
w.r.t.  Il*llq(py  9 ^ Qj)  (^p*  (2.5),  (2.9),  and  (2.10)  hold,  then 


(2.29) 


v/Jr(T(F J - T(F)) 


= 1 w.v.l 


log  log  n 


3.  FURTHER  PRELIMINARIES 

t 

Let  J be  integrable  on  [0,1].  Then  K(t)  = / J(u)du  is  absolutely  continuous 

0 

and  K' (t)  * .J(t)  wherever  J(t)  is  continuous.  For  any  d.f.  F,  K(F(x))  is  a 

right  continuous  function  of  bounded  variation.  If  J is  nonnegative  and 
1 

/ J(u)du  a 1,  then  K(t)  and  K(F(x))  are  d.f.'s. 

0 

The  following  lemmas  will  be  needed  in  the  proofs  of  Theorems  4-7.  Lemmas 
12  and  13  allow  the  difference,  T(F)  - T(G),  to  be  written  in  a convenient  form. 
Lemma  14  establishes  a simple  inequality.  Let  J,  be  the  set  of  all  J integrable 
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on  [0,1].  For  J e Jj  let  Fj  = {F:  |Jf‘^ rt)J(t)dt | < »>. 

LDWA  12.  If  F c F , then 

*J 

1 , * 

(3.1)  / F""(t)dK(t)  = / xdK(F(x))  . 

0 

PROOF.  Basically  we  need  to  justify  the  substitution  t=F(x).  Let  = (a^.b^] 
be  the  intervals  in  [0,1]  such  that  F ^(a^,bj^]  = Xj,  where  x^  is  a jump  point  of 
F and  F(Xj)  = b^  and  F(x^^-)  = a^.  Then 
1 « 

(3.2)  / F’^(t)dK(t)  = ^ / F"^(t)dK(t)  + / F‘^(t)dK(t)  . 

0 i=l  I.  [o,n-ui. 

Let  Sp  be  the  support  of  F and  A = {Xj^,X2,...)  the  set  of  jump  points  of  F.  Then 


(3.3)  / xdK(F(x))  = / xdK(F(x))  = ^ / xdK(F(x))  + / xdK(F(x))  . 

Sp  i=l  (X.)  Sp-A 


"/e  first  show  that 


(3.4)  I I F‘^(t)lK(t)  = I / xdK(F(x))  . 

i=l  Ij  i=l  (x.) 


Consider  the  i-th  integral  of  the  l.h.s.  of  (3.4).  "'e  have 


/ F'^(t)dK(t)  = X / dK(t)  = X jK(b  ) - K(a  )], 
li  ^ (a,.b,]  ^ ^ 


since  F ^(t)  * x^^  for  t e (aj^,b^].  The  i-th  integral  of  the  r.h.s.  of  (3.4)  is 


I xdK(F(x))  » xjK(F(x  ))  - K(F(x.-))] 
{3t.}  ^ ^ ^ 


- x.fK(b^)  - K(a.)l 
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by  definition  of  the  Stieltjes  integral  and  by  substitution  of  F(x^)  = b^, 

F(Xj^-)  * a^^.  Thus  we  have  term  by  term  equality  in  (3.4).  Now  to  justify 

(3.5)  / F'^(t)dK(t)  = / xdK(F(x)) 

[0,1]-Uli  Sp-A 

we  need  only  note  that  t»F(x)  is  a one-to-one  mapping  of  Sp-A  onto  (0,1] -Ul^ 
and  apply  a general  change  of  variable  lemma  (e.g.,  Dunford  and  Schwartz,  Vol  1, 
p.  182).  F.qualities  (3.2),  (3.3),  (3.4),  and  (3.5)  give  (3.1).  □ 

Related  to  the  preceding  lemma  is  the  familiar  identity, 

1 , 

/ F’^t)J(t)dt  = JxJ(F(x))dF(x), 

0 -“ 

which  is  valid  for  all  continuous  F,  but  not  however,  for  all  discontinuous  F. 
LBWA  13.  If  Fj,  F2  e fj,  then 

00  00  00 

(3.6)  |[K(F^(x))  - K(F2(x))]dx  = JxdK(F2(x))  - JxdK(Fj(x)). 


PROOF.  This  is  a trivial  application  of  integration  by  parts.  The 
result  is  well-known  when  K(t)  is  a d.f.  (e.g.,  Rao  (1973),  p.  95  (correcting 
a misprint)).  0 

Combining  Lemmas  12  and  13,  we  obtain, 


(3.7) 


T(F)  - T(G)  = /[K(G(x))  - K(F(x))]dx 


The  next  result  relates  to  the  quantity  Vg  p defined  for  d.f.'s  G and 


F by 


V » sup 

G(x)-F(x)^0 


G(x)-F(x) 
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where 


p(x)  = K(G(x))  - K(F(X))  - (G(x)  - F(x))J(F(x)) 

LB'-IHA  14.  Suppose  that  J e is  hounded.  Then 

'^G,F  " L • 


PROOF. 


|K(G(x))  - K(F(x))|  S 


G(x) 

J J(u)du 
F(x) 


5 |G(x)  - F(x)l  ||J|| 


Thus,  for  G(x)  F(x), 

Vg  p(x)  ^ |G(x)  - F(x)|  ||J|(^  + |G(x)  - F(x)|  |j(x)| 
G(x)-F(x)P  |G(x)-F(x)| 


^ 2|1J||„.  □ 

4 . ROBUST  L-FUNCTIONALS 

In  this  section  we  restrict  J to  be  bounded,  continuous  a.e.  Lebesgue  and 
a.e.  F \ and  trirmed  so  that  J(u)  * 0 near  0 and  1.  Specifically  this 
trimming  is  of  the  form 


J(u)  = 0 u e [G,t^)  u (t^.l] 

for  0<tj<t2<l.  As  noted  in  Section  1,  the  functionals  generated  by  these  special 
J functions  are  generally  viewed  as  robust  (for  justification,  see  Bickel  and 
Lehmann  (1975),  I.,  p.  1054).  itoreover,  by  separating  this  subclass  from 
the  more  general  functionals  of  the  next  section,  we  will  be  able  to  prove 
the  existence  of  the  differential  under  a minimum  of  conditions  and  with 
respect  to  the  simple  sup-norm,  I|Fjj-F||^  » sup|F^(x)-F(x)  | . 
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Let  F denote  a fixed  underlying  d.f.,  and  define  t^  and  t2  as  above. 

Note  that  we  may  take  F=Gp  = {all  d.f.'s}. 

THEOREM  4.  Suppose  that 

(4.1)  J is  hounded  and  continuous  a.e.  Lebeegue  and  a.e.  F"^; 

(4.2)  J(u)  = 0 for  u e [O.t^)  u (t2,l]. 

Then^  for  T(F)  = /f~^ (t)J (t)dt , the  differential  of  at  F w.r.t.  M'll^ 
is  given  by 

(4.3)  T(F;  A)  = -/A(x)J(F(x))dx. 

RE'iARKS.  (i)  Since  T(F;A)  is  clearly  linear  in  A,  the  conclusion  may 
be  restated  via  (2.2)  as 

(4.4)  T(G)  - T(F)  - /(F(x)  - G(x) ) J (F (x) )dx  = 6(|  1G-F|  | J as  | |G-F|  0. 

(ii)  If  F has  bounded  support,  then  (4.2)  is  not  required. 

(iii)  The  "a.e.  F~^"  statement  in  (4.1)  guarantees  that  J is  continuous 
where  F is  flat  (points  F(x)  such  that  x is  not  in  the  support  of  F) . When 
J is  continuous  except  at  the  trimming  points  t^  and  t2»  this  requirement 
reduces  to  the  assumption  that  t^  and  t2  correspond  to  unique  quantiles  of  F.  0 

PROOF  OF  THEOREM  4.  Since  T(F;A)  is  linear,  we  need  only  show  (4.4). 

By  (3.7)  the  l.h.s.  of  (4.4)  can  be  written  as 

-/[K(G(x))  - K(F(x))  - (G(x)-F(x))J(F(x))]dx  = -/v^  p(x)dx  . 

Let  (a,b)  be  such  that  G(a)  < t^,  F(a)  < t^,  G(b)  > t2»  F(b)  > t2-  Then, 
since  J(F(x))  and  K(G(x))  - K(F(x))  are  0 outside  (a,b). 
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Let  B a {x:  F(x)  is  a discontinuity  point  of  J}  and  define 


Vg  p(x) 

'^G,F^^^  ' G(xi-F(|x) 


if  G(x)  * F(x) 


if  G(x)  = F(x)  . 


Since  B is  a Lebesgue-null  set  (using  the  fact  that  J is  continuous  a.e. 


F'^) , we  have 


I/V  _(x)dx  =1  / V _{x)dx 


^ I / (G(x)  - F(x))(W  p(x))dx| 

(a.b)-B 

^ I1g-F|L  / |Wcp(x)(dx. 

(a,b)-B 


By  definition  of  the  derivative. 


l|G-F|L-  0 


W„  _(x)  -0  V X e (a,b)  - B 

b,h 


because  K'(F(x)  = J(F(x))  on  (a.b)  - B.  Since  p(x)  s p s 2||j||^  by  Lemma 
14,  we  can  justify  interchange  of  the  operations  of  limit  and  integration 
through  use  of  the  theorem  on  bounded  convergence  for  a finite  interval.  That  is, 

I/Vg  p(x)dx| 

lim  S lim  J W„  „(x)  dx 

l|G-F|L -0  ||G-F|L  ||G-F|L  > 0 (a,b)-B  * 


■ / lin  (x) } ’x  = 0.  0 

(a,b)-B  0 


Asymptotic  normality  and  an  LIL  follow  easily  from  Theorems  2,  3,  and  4. 
Note  that  for  the  differential  defined  in  Theorem  4,  we  have 

(4.5)  T[F;x]  = T(F;6^-F)  = /[F  (t) -I (x<t) ] J(F  (t) )dt . 

Thus 

(4.6)  Ep{T[F;X]}  = 0 , 
and 

00  00 

(4.7)  = Varp{T[F;X]}  = / Jj(F(s))J(F(t)) (F(min(s,t)  - F(s)F(t))dsdt. 

.00  .00 

2 

COROLLARY.  Suppose  that  J and  F satisfy  (4.1)  and  (4.2)  and  that  a > 0. 
Let  (Xj)  be  a sequence  of  independent  r.v.  's  having  distribution  F.  Then 

(4.8)  •'^(T(F^)-T(F))  -^>  K(0,a^)  as  n ® , 
and 

•^(T(F  )-T(F)) 

(4.9)  lim  1 w.p.l  . 

n-x»  — 2 

/2a  lop  log  n 

7 

PROOF.  Condition  (2.9)  is  given  by  (4.6)  and  the  assumption  that  a'  > 0. 
Condition  (2.5)  is  satisfied  since  Gp  in  this  case  is  the  set  of  all  d.f.'s. 
Condition  (2.10)  holds  by  hypothesis  and  Theorem  4 provides  the  existence  of 
the  differential.  Thus  the  conditions  of  Theorems  2 and  3 are  satisfied.  0 
We  could  formulate  the  above  corollary  for  dependent  variables  also. 
Specifically,  for  (4.8),  we  could  replace  the  independence  assumption  by  the 
assumption  that  the  {X^}  satisfy  (2.11)  and  (2.7)  with  ll’ll  = 

(4.9) ,  we  would  need  (2.12)  and  (2.8)  with  ||*||  = ll'llco* 
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Note  that  (4.9)  yields  strong  consistency  of  T(F^).  However,  by  appeal 


to  continuity  of  T,  we  can  slightly  relax  (4.1). 


THEOREM  5.  Suppose  that  J is  integrable  on  [0,1]  and  hounded  and  that 
(4.2)  holds.  Then  T is  continuous  at  F w.r.t.  ||°iL*  Further^  if  (X^)  is 
a sequence  of  independent  r.v.  's  having  distribution  F,  then 


(4.10) 


lim  T(F,  ) = T(F)  w.p.l 
n-wo  ‘ 


PROOF.  Let  (a,b)  be  as  in  the  proof  of  Theorem  4.  Then 


|T(G)-T(F)|  = |J[K(G(x))  - X(F(x))]cx| 


s |b-a|  IIJIL  I|g-f|L . 


■ hus  T is  continuous  at  F w.r.t.  ) |*)  | , and  an  appeal  to  Theorem  1 yields 


(4.10).  □ 


EXA-MPLES.  (i)  The  trimmed  mean. 


j(t)  = -j— ^ 
1- 


ai<t<02 


0 a. 


02  A 


(ii)  The  ; noothly  trimmed  mean  (from  Stigler  (1973)).  Let  = l-o^  = a 


and  c be  a constant. 
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J l<-;  =1 1: 


f ^ t < a 


a S t £ 1-a 


s t s i-j 


0^  a 
2 


1-a  1-a  1 

2 


(iii)  Consider  the  interquartile  range,  ^ — ^1/ ^ ^ . This  functional 

is  obtained  from  (1.1)  by  lettinp  K(t)  place  mass  y at  F"^(3/4)  and  -y  at 

F ^(1/4),  Although  Theorem  4 doesn't  apply,  directly  to  discrete  K(t)  we  can 
F"^  f3/4)-F~^  fl/4) 

approximate  ^ ^ ^ ^.by  a functional  with  absolutely  continuous  K(t). 


j(t)= 

26  86 


^ t s ^ 
4 4 


1 » 1+46 

2 t - y 

26^^  86 


i s t S i + 6 

4 4 


i t - 


|-6  s t s I 

4 4 


-4t  + 

26  86^^ 


4 4 


The  area  within  each  spike  is  j.  Note  that  for  this  functional  as  well  as  for 
the  smoothly  trimmed  mean,  J is  continuous  everywhere.  Thus  Theorem  4 applies 
to  these  functionals  for  any  d.f.  F. 

5.  GENERAL  L-FUNCTIONALS 

In  this  section  we  remove  the  trimming  restriction  on  J.  In  order  to 
deal  with  the  weight  placed  on  the  extremes  of  F,  we  will  use  the  q-norms 
introduced  in  Section  2.  Let 

F = {F:  |/F’^(t)J(t)dt|  < »}  and  Gp  = {G:  G e F and  c Sp}, 

where  Sp  is  the  support  of  F,  Let  q be  a bounded  positive  function  on  (0,1). 
THEOREM  6.  Suppose  that  J,  q,  and  F e F satisfy  (4.1)  and 

(5.1)  /q(F(x))dx  < * . 

Then  the  differential  of  T(F)  = /f"^  (t)J(t)dt  at  F w.r.t.  Il'llq^p)  6p 
is  given  hy  (4.3). 

REIARKS.  Condition  (5.1)  governs  the  tails  of  F.  For  q(t)  = [t (l-t)]^"^l, 

(5.1)  becomes 

(5.2)  /(F(x)(l-F(x)))^"'^ldx  < ® . 
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Stigler  ^1974,  p.  685)  notes  that  / (F(x) (1 -F(x) ) )^dx  < ® is  almost  the  same  as 
the  existence  of  a finite  2nd  movement.  D 

PROOF  or  THFOREM  6.  We  must  show  p(x)dx  = o(|  |G-F|  as 

-*•  0,  G e Gp.  Oefine  B as  in  the  proof  of  Theorem  4 and  let  the  closure  of 
^*1*^^  be  the  smallest  interval  (possibly  infinite)  containing  Sp.  Then 
for  G e G we  have 

r 


l/v  (x)dx 

s 

/ V p(x)dx 

1 » r 

(x,,x,)-B 

I 

Cx^.x^)-B 


G(x)-F(x) 

q(F(x) 


q(F(x))dx 


|G-F| 


q(F) 


(Xj,X2)-B 


q(F(x))dx. 


Once  again  we  have  only  to  show  that  the  interchange  of  limit  and  inte- 
gration is  valid,  since 


lim 


1|G-Fl| 

GcGp 


q(F) 


(x)|  = 0 


V X e(Xj ,x^)  - B. 


(Recall  that  for  G e Gp.  |G(x) -F(x)  | s|  |G-F  | | |q  | |__|  |G-F|  I^^PjVx. ) In  this 

case  we  appeal  to  dominated  convergence  by  way  of  Lemma  14.  That  is,  by 
Lemma  14, 


jW^  p(x)|q(F(x))  S pq(F(x))  S 2||j|(^  q(F(x)), 
and  the  r.h.s.  is  integrable  by  (5.1).  Thus  for  G e G we  have 

P 
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I/V-  p(x)dx| 

lim  ^ s lim  / |W^  (x)|q(F(x))dx 

1|G-F11^(F)-0  ||G-P||^^P)  1|G-F||^^PJ^0  (Xj.X2)-B 

= / lim  (W  p(x)  (qCF(x))dx  ■ 0,  □ 

(Xj.X2)-B  ||G-F||^^P^-kO 

Note  that  unbounded  J’s  could  be  allowed  if  the  above  interchange  of 
operations  could  be  justified  in  such  a situation.  Asymptotic  normality 
and  an  LIL  are  provided  by  the  following  corollary. 

COROLLARY.  Suppose  that  a defined  by  (4.7)  is  finite  and  positive.  Let 
{X^}  be  a sequence  of  independent  r.v.  's  having  distribution  F.  J/  q c 
and.  J and  F e F satisfy  (4.1)  and  (5.1),  then  (4.8)  holds.  If  q e 0^  and 
J and  F e F satisfy  (4.1)  and  (5.1),  then  (4.9)  holds. 

The  following  analogue  of  Theorem  5 gives  continuity  of  T and  strong 
consistency  of  T(F^). 

THEOR01  7.  Let  q e Qj . Suppose  that  J is  integrable  on  [0,1]  and 
bounded  and  that  (5.1)  holds.  Then  T is  continuous  at  F w.r.t.  \ |•||q(PJ• 

Further y if  (X^)  is  a sequence  of  independent  r.v.  'e  having  distribution  F, 
then  (4.10)  holds. 

PROOF. 


|T(G)-T(F)|  = |/[K(G(x))-K(F(x))]dxl 
^ I|J|L  /|G(x)-F(x)|dx 
^ I|J|LI|G-Fllq(p)  /q(F(x))dx.  □ 

EXAMPLES,  (i)  The  mean,  J(t)Hl.  In  this  case  Lemma  12  yields  the 
familiar  identity 

1 , 

/ F‘^(t)dt  = / xdF(x). 


-29- 


(ii)  Gini's  mecm  difference,  J(t)  » t - 
(iii)  A location  estimator  suggested  by  Bickel  (1973),  J(t)  = 6t(l-t). 

6.  EXTENSION  TO  A LARGER  CLASS  OF  L-FUNCTIONALS 

rn 

Some  authors  consider  the  slightly  more  general  statistic 
(e.g.  Chemoff,  Gastwirth,  and  Johns(1967)) . Similarly,  our  functional  is 

- 1 n 

(6.1)  T(F)  = / h(F’"(t))dK(t)  . 

0 

Since  h(x)  = x is  commonly  the  function  used  in  applications,  attention  has 

been  confined  to  this  case  in  Sections  3-5.  However,  it  is  easy  to  extend 

the  results  of  those  sections  to  T defined  by  (6.1).  Let  be  the  set  of 

continuous  functions  h defined  on  (-®,«)  such  that  h = h^  - h"  for  monotone 

increasing  functions  h^  and  h".  For  h e fl^  let  = (F;  |/h(F”^ (t))dK(t) | < «} . 

First  we  give  without  proof  the  analogues  to  Lemmas  12  and  13. 

LEMi'IA  12*.  7/  F e Fj^,  then 

1 « 

/ h(F‘^(t))dK(t)  = / h(x)dK(F(x)). 

0 -“> 

LEH1A  13*.  If  F,,F,  e F,  , then 
12  11 

/(K(F^x))-K(F2(x)))dh(x)  * /h(x)dK(F2(x))  - Jh(x)dK(Fj (x)) . 

Let  be  the  measure  corresponding  to  h^  + h"  and  let  B be  defined  as 
in  the  proof  of  Theorem  4.  The  following  are  analogues  to  Theorems  4 and  6. 

We  omit  the  proofs  as  well  as  the  analogues  to  Theorems  5 and  7. 

THEORFI  4.*  Suppose  that  J,  h,  and  F e F^^  satisfy  (4.2)  and 

(6.2)  J is  hounded  and  continuous  a.e.  and  U^(B)  » 0. 
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Then  T defined  by  (6.1)  hae  a differential  at  F w.r.t.  | |*| l„  given  by 

(6.3)  T(F;A)  = -/A(x)J(F(x))dh(x). 

THEOREM  6.*  Suppose  that  J,  h,  q,  and  F e satisfy  (6.2)  and 

(6.4)  |Jq(F(x))dh(x)|  < «. 


Then  T defined  by  (6.1)  has  a differential  at  F w.r.t.  Il'llq^pj 
Gp  = (G:  G e Fj^  and  e Sp)  given  by  (6.3). 

Bickel  and  Lehmann  (1976,  III)  discuss  the  functional 


(6.5) 


(F)  = 


J 

0 


(F'^(t))“dK(t) 


]_ 

a 


wliere  F^  denotes  the  distribution  of  |X-p|  when  X has  distribution  F,  u is 

fi  r-c-m  constant,  a > 0,  and  K(t)  is  a d.f.  on  (0,1).  As  a functional  of 

F,  (6.5)  is  not  amenable  to  our  methods.  However,  if  we  let  x(F)  = T(F^), 

a functional  of  F^,  it  is  easy  to  see  that  [T(F^)]“  has  the  appropriate  form 

v/ith  h(x)  = x*^.  Then,  assuming  the  conditions  of  Theorem  4*  (or  Theorem  6*) 

we  have  that  the  differential  of  [T(0]'*  at  w.r.t.  11*11^  (or  w.r.t.  ||*| 

and  Gp  ) is  given  by 

w 

(6.6)  T(F^;A)  - -/A(x)J(F^(x))d(x“)  . 

Since  we  really  want,  the  differential  of  T(»)  rather  than  the  differential 
of  [T(*)]'*,  the  following  "chain  rule"  is  required. 

LEMf4A  15.  Let  f(x)  be  a real-valued  function^  f:  R-^,  with  a derivative 
at  x*T(F),  f'(T(F)).  Suppose  further  that  the  functional  T(*)  has  a differ- 
ential at  F w.r.t.  11*11  and  ^ given  by  T(F;  G-F),  and  that  T(G)  - T(F)  is 


q(F) 
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0(I|G-F||)  as  ||G-F||-«0.  Then  the  aompoaite  functional  S(0  = f(T(-))  has  a 
differential  w,r.t.  | 1 • 1 1 and  Gp  given  by 


(6.7)  S(F;A)  = f ’ (T(F))T(F;A) . 

PROOF . Trivial . 

We  apply  this  lemma  with  f(x)  = so  that  f'(x)  = (x^^“‘^)/a.  Combining 

(6.6)  and  (6.7),  we  obtain 

[T(F  )]^-“  « 

= T • / CFp(x)-G(x))J(F^(x))d(x“). 

2 

EXA^IPLE.  Let  (u,o  ) be  the  mean  and  variance  of  F,  a=2,  and  K(t)*t. 

Then  T(F^)  is  the  standard  deviation  o of  F,  and  the  differential  is 


(6.8) 


T(F  ;G-F  ) 
U V 


I (F^(x)-G(x))d(x^) 

2t(f  ) 

p 


^ x‘-d(G(x)-F^(x)) 

2T(F  ) 

u 


For  a sample  Xj,...,X^  of  independent  r.v.'s  with  distribution  F,  let 

and  F^^  be  the  sample  d.f.  formed  from  the  Y. 's.  Then  substitution 
of  G»F^^  in  (6.8)  yields 


T(F 


-p , . i I 

u’  pn  w'  2o  n i= 


{ 2 2 
Yi  -a 


n i=l 


2a  ) 


Since  E(Y^^-o^)  = 0,  and  E(Y^^-o^)^  * E(X-p)^  - a^,  the  central  limit  theorem 
yields  (via  Lemma  2)  that 


•^(T(F^n^  - o)  N 


4 4 

^ E(x-p)  -0 

j,  = 


4o 


as  n ->  ». 
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This  last  result  could  have  been  anticipated  since  turns  out  to  be  the 

usual  estimator  of  o: 

00  00 

T(F  n)  = [/  x^dF  (x)]^  = / x^d(I(-xsX  -usx))]^ 

un  Q un  II  Q 1 

= [„  I (Xj-V)  ] . 

i=l  ^ 

7,  CONCLUSIONS  AND  COMPARISONS 

Stigler  (1974)  provides  good  motivation  for  the  use  of  L-statistics 
which  are  generated  by  smooth  weight  functions  J.  One  inherent  value  of 
these  statistics  is  that  the  theorems  justifying  their  use  (asymptotic 
normality,  etc.)  place  the  force  of  the  restricting  conditions  on  J (over 
which  we  have  control)  rather  than  on  F,  which  is  often  only  partially  known. 
Thus,  practicing  statisticians  can  actually  verify,  rather  than  assume,  most 
of  the  needed  hypotheses.  From  consideration  of  counterexamples,  it  appears 
that  J continuous  a.e.  F~^  is  essentially  a necessary  condition  for  asymptotic 
normality  (c.f.  Stigler  (1974),  Section  5.6).  Requiring  J to  be  bounded  is 
not  very  restrictive,  since  in  most  situations,  unbounded  J functions  would 
produce  notoriously  nonrobust  estimators.  Hence,  J bounded  and  continuous 
a.e.  F”^  is  a natural  restriction  which  we  and  Stigler  (1974)  have  in  common. 
The  additional  requirement  that  J be  continuous  a.e.  Lebesgue  is  necessary 
for  our  Theorems  4 and  6 and  Stigler  needs  it  for  several  of  his  theorems 
(c.f.  Stigler  (1974),  Theorems  3 and  4).  It  should  be  noted  that  Stigler's 
statistic,  defined  by 
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is  somewhat  different  from  our  (1.3).  Nevertheless,  both  converge  to 
/F"^(t)J(t)dt,  and  rough  comparisons  can  be  made.  Basically,  Stigler's 
Theorems  2 and  5,  which  establish 

(7.1)  ~aiT) N(0,1)  as  n -►  ® , 

require  a little  less  than  our  Theorems  4 and  6.  However,  for  practical  use, 
(7.1)  is  needed  with  E(Sj^)  replaced  by  (t)J(t)dt  and  <j(^’j^)  replaced  by 
o defined  by  (4.6).  To  accomplish  this,  Stigler  requires  stronger  assumptions 
on  J than  our  (4.1)  (see  Theorem  4,  Stigler  (1974)).  On  the  other  hand,  the 
tail  condition  (5.2)  is  not  quite  as  mild  as  / [ (F (x) ) (1-F (x))] ^dx  < »>,  the  one 
given  in  Stigler's  Theorem  4.  Stigler  extends  to  certain  independent  but 
non-identically  distributed  variables,  whereas  we  can  extend  to  identically 
distributed  but  dependent  variables.  Of  course,  our  method  also  establishes 
strong  consistency  and  an  LIL  without  additional  assumptions. 

The  results  of  Gregory  (1976),  like  ours,  are  obtained  by  differential 
methods.  He  proves  Theorem  4*  for  J bounded  and  continuous  and  F absolutely 
continuous.  It  appears  t’oat  his  use  of  the  chain  rules  associated  with 
formal  Frechet  differentiation  requires  J to  be  continuous  everywhere.  This 
indicates  the  power  of  our  more  flexible  version  of  the  differential  approach. 
Weakening  of  the  absolutely  continuous  assumption  on  F could  be  made  via  our 
Lemma  8.  Also,  his  special  q function  yields  a little  sharper  asymptotic 
normality  result,  though  it  doesn't  appear  to  belong  to  or  Q^. 

Wellner  (1977a),  (1977b)  provides  somewhat  more  general  results  for 
strong  consistency  and  the  LIL.  In  particular,  his  LIL  is  of  the  Strassen 
type  and  allows  combinations  of  quantiles  as  well. 
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We  see  that  Theorems  4-7  provide  improved  results  with  regard  to  asymptotic 
normality  of  L-statistics  and  additional  results  for  strong  consistency  and 
the  LIL.  Furthermore,  applications  to  different  situations  involving  depen- 
dence are  straightforward  as  the  relevant  theory  concerning  ||F^-F||  becomes 
available. 
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